Blind Recovery of Perceptual Models in Distributed Speech and Audio Coding

نویسندگان

  • Tomas Bäckström
  • Florin Ghido
  • Johannes Fischer
چکیده

A central part of speech and audio codecs are their perceptual models, which describe the relative perceptual importance of errors in different elements of the signal representation. In practice, the perceptual models consists of signal-dependent weighting factors which are used in quantization of each element. For optimal performance, we would like to use the same perceptual model at the decoder. While the perceptual model is signal-dependent, however, it is not known in advance at the decoder, whereby audio codecs generally transmit this model explicitly, at the cost of increased bit-consumption. In this work we present an alternative method which recovers the perceptual model at the decoder from the transmitted signal without any side-information. The approach will be especially useful in distributed sensor-networks and the Internet of things, where the added cost on bit-consumption from transmitting a perceptual model increases with the number of sensors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Blind Tamper Detection in Audio using Chirp based Robust Watermarking

In this paper, we propose the use of ‘chirp coding’ for embedding a watermark in audio data without generating any perceptual degradation of audio quality. A binary sequence (the watermark) is derived using energy based features from the audio signal and chirp coding used to embed the watermark in audio data. The chirp coding technique is such that the same watermark can be derived from the ori...

متن کامل

L2 Learners’ Lexical Inferencing: Perceptual Learning Style Preferences, Strategy Use, Density of Text, and Parts of Speech as Possible Predictors

This study was intended first to categorize the L2 learners in terms of their learning style preferences and second to investigate if their learning preferences are related to lexical inferencing. Moreover, strategies used for lexical inferencing and text related issues of text density and parts of speech were studied to determine their moderating effects and the best predictors of lexical infe...

متن کامل

A warped linear-prediction-based subband audio coding algorithm

In this paper, a novel audio coding algorithm is proposed where the warped linear prediction (WLP) technique is employed to construct a perceptual preand post-filter for subband audio coding. A modified signal-to-mask ratio (SMR) calculation is given for subband coding of the WLP residuals of audio signals. The concept of perceptual entropy (PE) is extended to subband coding, resulting in the s...

متن کامل

A Qualitative Meta-analysis of Perceptual-motor Problems in Visually Impaired People

Introduction: Perceptual motor activities improve motor skills and learning. These skills play an effective role in receiving, interpreting and responding to the sensory stimuli. This study aimed to identify perceptual-motor problems in visually impaired people. Methods: This qualitative research was conducted using a research synthesis method. Therefore, the analysis unit consisted of all the...

متن کامل

Wideband Speech Recovery Using Psychoacoustic Criteria

Manymodern speech bandwidth extension techniques predict the high-frequency band based on features extracted from the lower band.While this method works for certain types of speech, problems arise when the correlation between the low and the high bands is not sufficient for adequate prediction. These situations require that additional high-band information is sent to the decoder. This overhead ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016